Heuristics for the Phylogeny Problem

نویسندگان

  • Alexandre A. Andreatta
  • Celso C. Ribeiro
چکیده

A phylogeny is a tree that relates taxonomic units, based on their similarity over a set of characters. The problem of nding a phylogeny with the minimum number of evolutionary steps is one of the main problems in comparative biology. In this work, we study diierent heuristic approaches to the phylogeny problem under the parsimony criterion. New algorithms based on GRASP, VNS, and tabu search metaheuristics are proposed. All heuristics are implemented and compared under the same framework, leading to consistent and thorough comparative results. Computational results are reported for benchmark instances from the literature. 1. Phylogeny Problem. A phylogeny (or an evolutionary tree) is a tree that relates groups of species, populations of distinct species, populations of the same species, or homologous genes in populations of distinct species, indistinctly denoted by taxons 3, 32, 33]. Such relations are based on the similarity over a set of characters. Tree leaves represent the taxons under analysis, while interior nodes represent hypothetical (unknown) ancestors. Characters are independent attributes used to compare taxons. A character may represent e.g. a morphologial attribute or a position within a sequence of nucleotides. Each character take values on a nite set of possible states. For example, if some character describes a position in a sequence of nucleotides, then the possible states are \Adenine", \Cytosine", \Guanine", and \Timine". Each taxon is deened by their character states. Binary characters are those who have only two possible states, which represent the presence or the absence of some attribute. Instances of the phylogeny problem (with binary characters) are then characterized by 0-1 matrices, in which each element (i; j) corresponds to the state of character j within taxon i. Each row represents the characteristic vector of some taxon. Diierent criteria may be used in the evaluation of phylogenetic trees. Each state change along a branch of the phylogenetic tree is counted as an evolutionary step. The parsimony criterion states that the best phylogeny is the one that can be explained by the minimum number of evolutionary steps. It is frequently said that this criterion is the best one if the probability of character mutation is very small 22, 24, 31]. Given a set of taxons deened by a set of characters, the phylogeny problem (under parsimony) consists in nding a phylogeny with the minimum number of evolutionary steps. The phylogeny problem is one of the main problems in comparative biology.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Effective heuristics and meta-heuristics for the quadratic assignment problem with tuned parameters and analytical comparisons

Quadratic assignment problem (QAP) is a well-known problem in the facility location and layout. It belongs to the NP-complete class. There are many heuristic and meta-heuristic methods, which are presented for QAP in the literature. In this paper, we applied 2-opt, greedy 2-opt, 3-opt, greedy 3-opt, and VNZ as heuristic methods and tabu search (TS), simulated annealing, and pa...

متن کامل

A Comparison of Four Multi-Objective Meta-Heuristics for a Capacitated Location-Routing Problem

In this paper, we study an integrated logistic system where the optimal location of depots and vehicles routing are considered simultaneously. This paper presents a new mathematical model for a multi-objective capacitated location-routing problem with a new set of objectives consisting of the summation of economic costs, summation of social risks and demand satisfaction score. A new multi-objec...

متن کامل

A bi-objective model for a scheduling problem of unrelated parallel batch processing machines with fuzzy parameters by two fuzzy multi-objective meta-heuristics

This paper considers a bi-objective model for a scheduling problem of unrelated parallel batch processing machines to minimize the makespan and maximum tardiness, simultaneously. Each job has a specific size and the data corresponding to its ready time, due date and processing time-dependent machine are uncertain and determined by trapezoidal fuzzy numbers. Each machine has a specific capacity,...

متن کامل

Relax and Fix Heuristics for Simultaneous Lot Sizing and Sequencing the Permutation Flow Shops with Sequence-Dependent Setups

  This paper proposes two relax and fix heuristics for the simultaneous lot sizing and sequencing problem in permutation flow shops involving sequence-dependent setups and capacity constraints. To evaluate the effectiveness of mentioned heuristics, two lower bounds are developed and compared against the optimal solution. The results of heuristics are compared with the selected lower bound .

متن کامل

Algorithms for Imperfect Phylogeny Haplotyping (IPPH) with a Single Homoplasy or Recombination Event

The haplotype inference (HI) problem is the problem of inferring 2n haplotype pairs from n observed genotype vectors. This is a key problem that arises in studying genetic variation in populations, for example in the ongoing HapMap project [5]. In order to have a hope of finding the haplotypes that actually generated the observed genotypes, we must use some (implicit or explicit) genetic model ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • J. Heuristics

دوره 8  شماره 

صفحات  -

تاریخ انتشار 2002